Weibull regression with Bayesian variable selection to identify prognostic tumour markers of breast cancer survival.
نویسندگان
چکیده
As data-rich medical datasets are becoming routinely collected, there is a growing demand for regression methodology that facilitates variable selection over a large number of predictors. Bayesian variable selection algorithms offer an attractive solution, whereby a sparsity inducing prior allows inclusion of sets of predictors simultaneously, leading to adjusted effect estimates and inference of which covariates are most important. We present a new implementation of Bayesian variable selection, based on a Reversible Jump MCMC algorithm, for survival analysis under the Weibull regression model. A realistic simulation study is presented comparing against an alternative LASSO-based variable selection strategy in datasets of up to 20,000 covariates. Across half the scenarios, our new method achieved identical sensitivity and specificity to the LASSO strategy, and a marginal improvement otherwise. Runtimes were comparable for both approaches, taking approximately a day for 20,000 covariates. Subsequently, we present a real data application in which 119 protein-based markers are explored for association with breast cancer survival in a case cohort of 2287 patients with oestrogen receptor-positive disease. Evidence was found for three independent prognostic tumour markers of survival, one of which is novel. Our new approach demonstrated the best specificity.
منابع مشابه
برتری روش بیزی در تحلیل بقای 8 ساله سرطان پستان و تعیین عوامل موثر بر آن در شهر یزد
Introduction: Breast cancer is one of the common diseases among women with various factors involved in its development. The aim of this study was to determine the factors affecting the survival of women with breast cancer in Yazd using Cox's model as Bayesian and Classic. Method: A population-based study of 538 breast cancer women registered in the clinical database of the Ramezanzade Radiothe...
متن کاملمقایسه مدلهای بیزی پارامتریک در تحلیل عوامل مؤثر بر میزان بقای بیماران مبتلا به سرطان معده
Background & Objectives: The Cox proportional-hazards regression and other parametric models model have achieved widespread use in the analysis of time-to-event data with censoring and covariates. However employing Bayesian method has not been widely used or discussed. The aim of this study was to evaluate the prognostic factors in using Bayesian interval censoring analysis.Methods: This cohort...
متن کاملBayesian Grouped Variable Selection
Traditionally, variable selection in the context of linear regression has been approached using optimization based approaches like the classical Lasso. Such methods provide a sparse point estimate with respect to regression coefficients but are unable to provide more information regarding the distribution of regression coefficients like expectation, variance estimates etc. In the recent years, ...
متن کاملExtracting Predictor Variables to Construct Breast Cancer Survivability Model with Class Imbalance Problem
Application of data mining methods as a decision support system has a great benefit to predict survival of new patients. It also has a great potential for health researchers to investigate the relationship between risk factors and cancer survival. But due to the imbalanced nature of datasets associated with breast cancer survival, the accuracy of survival prognosis models is a challenging issue...
متن کاملIdentification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis
Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistical methods in medical research
دوره 26 1 شماره
صفحات -
تاریخ انتشار 2017